Problems related to 
a de Bruijn - Erdos theorem 



Xiaomin Chen 

Department of Computer Science, 
Rutgers University, Piscataway, NJ 08854-8019, USA 

Vasek Chvatal 1 

Canada Research Chair in Combinatorial Optimization, 
Department of Computer Science and Software Engineering, 
Concordia University, Montreal, Quebec H3G 1M8, Canada 



In memory of Leo Khachiyan 



Abstract 

De Bruijn and Erdos proved that every noncollinear set of n points in the plane 
determines at least n distinct lines. We suggest a possible generalization of this 
theorem in the framework of metric spaces and provide partial results on related 
extremal combinatorial problems. 
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1 Lines in metric spaces 



Two distinct theorems are referred to as "the de Bruijn - Erdos theorem". One 
of them [9] concerns the chromatic number of infinite graphs; the other [8] is 
our starting point: Every noncollinear set of n points in the plane determines 
at least n distinct lines. 
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search Chairs Program and from the Natural Sciences and Engineering Research 
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This theorem involves neither measurement of distances nor measurement of 
angles: the only notion employed here is incidence of points and lines. Such 
theorems are a part of ordered geometry [7], which is built around the ternary 
relation of betweenness: point y is said to lie between points x and z if y is an 
interior point of the line segment with endpoints x and z. It is customary to 
write [xyz] for the statement that y lies between x and z. In this notation, a 
line v/U is defined - - for any two distinct points u and v - - as 

{p : [pm>]} U {u} U {p : [wpu]} U {v} U {p : [iii>p]}. (1) 

In terms of the Euclidean metric p, we have 

[abc] 4=> a,b,c are three distinct points and p(a, b) + p(6, c) = p(a, c). (2) 

For an arbitrary metric space, equivalence (2) defines the ternary relation of 
metric betweenness introduced in [13] and further studied in [2,4,6]; in turn, 
(1) defines the line uv for any two distinct points u and v in the metric space. 
The resulting family of lines may have strange properties. For instance, a line 
can be a proper subset of another: in the metric space with points u, v, x, y, z 
and 

p[u, v) = p(v, x) = p(x, y) = p(y, z) = p(z, u) = 1, 
p(u, x) = p(v, y) = p(x, z) = p(y, u) = p(z, v) = 2, 

we have 

vy = {v,x, y} and xy = {v , x, y, z}. 

Nevertheless, fragments of ordered geometry might translate to the framework 
of metric spaces. In particular, we know of no counterexample to the de Bruijn 
- Erdos theorem in this framework. 

Question 1 True or false? Every finite metric space (X,p) where no line 
consists of the entire ground set X determines at least \X\ distinct lines. 



2 Lines in hypergraphs 

A hypergraph is an ordered pair (X, H) such that X is a set and if is a family 
of subsets of X; elements of X are the vertices of the hypergraph and members 
of H are its edges. Our definition of lines in a metric space (X, p) depends 
only on the hypergraph (X,H(p)) where 

H(p) = {{a,b,c} : [abc]} : 
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the line uv equals {u, v } U {p : {u, v,p} G H(p)}. This observation leads us to 
extend the notion of lines in metric spaces to a notion of lines in hypergraphs: 
given an arbitrary hypergraph (X, H) , we define the line uv ■ - for any two 
distinct vertices u and v — as {u, v } U {p : 3T (T G if, {w, i>,p} C T) }. Now 
every metric space (X, p) and its associated hypergraph (X,H(p)) define the 
same family of lines. 

A hypergraph is called k -uniform if each of its edges consists of k vertices. All 
the hypergraphs (X, H(p)) are 3-uniform, but some 3-uniform hypergraphs do 
not arise from any metric space (X, p) as (X, H(p)): it has been proved ([6,5]) 
that the hypergraph consisting of the seven vertices 0,1,2,3,4,5,6 and the 
seven edges 

{i mod 7, (i + 1) mod 7, (i + 3) mod 7} (i = 0, 1, 2, 3, 4, 5, 6) 

does not arise from any metric space. (This 3-uniform hypergraph is known 
as the Fano plane or the projective plane of order two.) Restricting the notion 
of lines to 3-uniform hypergraphs would bring about no loss of generality: for 
every hypergraph (X, H) there is a 3-uniform hypergraph (X, H^) such that 
(X,H) and (X,H^) define the same family of lines. Specifically, 

#( 3 ) = {S : \S\ = 3 and 3T (T G H, S C T) }. 

Let m(n, k) denote the smallest number of lines in a hypergraph on n vertices 
where every line consists of at most k vertices. Showing that m(n,n — 1) > n 
would show that the answer to Question 1 is "true" . However, as we are going 
to prove, m(n, n — 1) grows slower than every power of n. 

Lemma 2 If n, £, a are positive integers such that 2 < n — £ < a e , then 

m{n,n-l) < 2 l + la. 

PROOF. Write P — {1, 2, . . . , £} and let A be a set of size a. By assumption, 
there is a set S of strings of length £ over alphabet A such that \S\ = n — £ and 
such that, for each i in P, some two strings in S differ in their i-th position. 
For each choice of % in P and x in A, set 

E ix = {i} U {xix 2 . . .x e E S : Xi = x}. 

Now consider all the lines uv in the hypergraph 

{PUS, {P,S}u{E ix -.ieP,xe A}). 

If u, v G P, then uv = P. If u G P and v G S, then uv = E ux with x the w-th 
character in v. If u, v G S, then WJ = S U P' with P' the set of positions in 
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which u and v agree; P' is a proper (and possibly empty) subset of P. So the 
hypergraph has n vertices, none of its lines consists of all n vertices, and there 
are at most 1 + £a + (2 e — 1) lines. □ 



Theorem 3 There are positive constants n and c such that 

n > no =>- m(n, n — 1) < cr^*™. 

for all n. 



(3) 



PROOF. Let a,/3, 7, 5 be arbitrary constants such that 

0<a<l</9<7<2<5. 
There is a positive integer £ such that 

£>£ a £<£-l, (5 l <7*-l, £^<2 e , 2 i+1 < 5 £ . 
We claim that (3) holds as long as 



n > n =>- n — 



/Inn 

h/3 



> 2 



and 



In n > £\ In f3, In c > 



In 5 



To justify this claim, consider an arbitrary n such that n > n and set 



/Inn 
hT/3 



Now £ > £0, a > (3 e , and so £lna > £ 2 ln/3 > Inn. Lemma 2 guarantees that 

m(n,n- 1) < 2^ + £a; 



since 



we have 



□ 



£-11 /Inn 
a a V In f3 

V£a< 2 £+1 < 5 £ < c^. 



We do not know the order of growth of m(n,n — 1); our best lower bound is 
only logarithmic in n. (We follow the convention of letting lg stand for the 
logarithm to base 2.) 
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Theorem 4 m(n,n — 1) > lgn. 



PROOF. Consider an arbitrary hypergraph with n vertices and m lines where 
no line consists of all n vertices. Let us observe that 

for every two distinct vertices u and v, 

there is a line which includes u and does not include v : (4) 

by assumption, some vertex w is not included in line uv, and so no edge 
includes all three vertices u,v,w, and so line WW includes u and does not 
include v. For each vertex x, let S x denote the set of all lines that include x. 
Property (4) guarantees that these n sets are all distinct, and so n < 2 m . □ 



Actually, property (4) guarantees that the n sets S x form an antichain in 
the sense that none of them is a subset of another. This observation allows a 
negligible improvement of the bound in Theorem 4: first, the classic result of 
Sperner ([15]) asserts that an antichain on a ground set of size m has at most 



m 
>/2j. 



sets; next, by Stirling's formula, 



m 



U m /2J/ yVm/2 ' 
finally, if m = lg n + \ lg lg n + c, then 

om 

2 c (2/ 7 r) 1 / 2 n. 



' -Km l '2 

It follows that for every positive e there is an no such that 

1 1 7T 

n>n =>- m(n, n — 1) > lgn +- lg lgn + - lg — — e. 



Since m(n, k) is a nonincreasing function of k, Theorem 4 guarantees that 
m(n, k) > lgn whenever 2 < k < n. For small values of k, this bound can be 
much improved. 

Theorem 5 



whenever n > k > 2 
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PROOF. Consider an arbitrary hypergraph with n vertices and m lines where 
every line consists of at most k vertices. Trivially, 



for every two distinct vertices u and v, 

there is a line which includes both u and v. (5) 

Let P denote the set of all pairs (L, {u, v}) such that L is a line and u, v are 
two distinct vertices in L. On the one hand, every line includes at most k 
points, and so 

\P\ <mg). 
On the other hand, property (5) guarantees that 

m * (:) 

The lower bound on m follows by comparing the two bounds on \P\. □ 



When the value of k is fixed, the lower bound of Theorem 5 is asymptotically 
optimal: 

Theorem 6 

fc(fc-l) 

hm m(n,k)-—. = 1 

n^oo n(n - 1) 

whenever k > 2. 



PROOF. Theorem 5 guarantees that 

liminf m(n, k) ■ — > 1 . 

In every A;-uniform hypergraph (X, H) such that 

every two edges share at most one vertex, (6) 

each line is either an edge or a set of two vertices that is not a subset of any 
edge, and so there are 

,.,.(('?) - V Q 

lines altogether. In particular, with f(n, k) standing for the largest number of 
edges in a /c-uniform hypergraph with n vertices and with property (6), we 
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have 



m(n,k)<r \-f(n,k) U -1 ; 



Erdos and Hanani [11] proved that 

lim f(n, k) ■ k [ k - ]\ = 1 ; 
n-oo ^ > n (n-l) 

it follows that 

fc(fc-l) 

hmsup m(n, /e) ■ — < 1 . 

n-»oo n(n — 1) 

□ 



3 Closure-lines in hypergraphs and metric spaces 



The Sylvester- Gallai theorem [16,10,7,3,12,14,6] asserts that every noncollinear 
finite set X of points in the plane includes two points such that the line passing 
through them includes no other point of X. This theorem does not translate 
to the framework of metric spaces along the simple lines of our Section 1: 
in the five-point example of that section, every line consists of three or four 
ponts. Nevertheless, it does translate to the framework of metric spaces in a 
circuitous way, which we are about to describe. 

Let us call a set T of vertices in a hypergraph affinely closed if, and only if, 
every edge that shares at least two vertices with T is fully contained in T. For 
every set S of vertices, the intersection of all affinely closed supersets of S is an 
affinely closed set, which we will refer to as the affine closure of S and which 
we will denote by aff(S'). By closure-lines in the hypergraph, we shall mean 
all the sets aff({-u,t>}) with u and v two distinct vertices; by closure-lines in a 
metric space (X, p), we shall mean closure-lines in its associated hypergraph 
(X,H(p)). 

When X is a subset of a Euclidean space and p is the Euclidean metric, lines 
and closure-lines in (X, p) coincide: each of them is the intersection of X 
and the Euclidean line passing through two distinct points of X. One of us 
[6] conjectured and the other one [5] proved that the notion of closure-lines 
provides a translation of the Sylvester-Gallai theorem to the framework of 
metric spaces: 

In every finite metric space, some closure-line includes 
either all the points of the ground set or only two of them. 

The same notion falls far short of providing a translation of the de Bruijn - 
Erdos theorem to the framework of metric spaces: 
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Theorem 7 For every integer n greater than 5, there is a metric space on n 
points where each closure-line consists of at most n — 2 points and there are 
precisely 7 distinct closure-lines altogether. 



PROOF. Consider the metric space (X, p), where X = {x k : 1 < k < n} 
with 



a* = (1,3), x 2 = (2,4), x 3 = (3,l), x 4 =(4,2), 
x k = (k,n + 5 — k) whenever 5 < k < n, 

and 

p((a 1 ,a 2 ), (h,b 2 )) = |a x - &i| + |a 2 - b 2 \. 

Since H(p) consists of all {xi, x 2l x k } with 5 < k < n, all {x^, X4, x k } with 
5 < k < n, and all {xi, Xj,x k } with 5 < i < j < k < n, we have 



aff ({xi, x 2 }) = X — {x 3 , x^}, 
aff ({x 3 , x 4 }) = X - {x 1 ,x 2 }, 

aff ({xi, Xj}) = X — {xi, x 2 , X3, X4} whenever 5 < i < j < n, 
aff ({xi, Xj}) = {xi, Xj} whenever f < i < 2 and 3 < j < 4, 
aff ({xi, Xj}) = X — {x 3 , X4} whenever f < i < 2 and 5 < j < n, 
aff ({xi, Xj}) = X — {xi, x 2 } whenever 3 < i < 4 and 5 < j < n. 



□ 



Finally, let m(n, k) denote the smallest number of closure-lines in a hypergraph 
on n vertices where every closure-line consists of at most k vertices. Our proof 
of Theorem 5 with "lines" replaced by "closure-lines" shows that 

. n(n - 1) 

whenever n > k > 2; in turn, our proof of Theorem 6 with "lines" replaced 
by "closure-lines" yields the following conclusion. 

Theorem 8 

hm m(n,k) ■ — = 1 

whenever k > 2. 

The order of growth of m(n, k) is given by its lower bound (7): 
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Theorem 9 There is a positive constant c such that 

n(n — 1) . , . n(n — 1) 

; < m(n,k) < c v 



k(k-l) ~ v ' ' ~ k(k-l) 
whenever n > k > 2. 



PROOF. For every integer k greater than 1, Theorem 8 guarantees the ex- 
istence of a constant c k such that 

n(n — 1 ) 

m(n, k) < Ck ■ -rj-. —- whenever n> k. (8) 

k(k — 1) 

With c any constant such that 

c > 12 and c > c k whenever 2 < k < 12, 
we propose to show that, for every integer k greater than 1, 

_. . n(n — 1) , . . 

m(n, k) < c • — — — whenever n > k. (9) 

K[K 1 I 

(Trivially, m(n,k) = 1 whenever 2 < n < k.) For this purpose, consider an 
arbitrary but fixed integer k greater than 1. If k < 12, then (9) follows from 
(8); if k > 12, then we will use induction on n to prove that m(n, k) < cn 2 /k 2 
whenever n > k. 

Set 

[n+1 
P = 2 — 
and note for a future reference that 

'n+1 \ 4n 



/n + 1 \ An 
4< p<2 (_ + 1 )<_. 



Take a set X such that \X\ = n, take a subset X of X such that \X \ = p— 1, 
and partition X — X into pairwise disjoint sets (1 < i < p) whose sizes are 
as nearly equal as possible. Since 

k _ w - (p - 1) < k _ 
4 p - 2 

we have 

fc — 1 

2 < min | V{ \ < max | Vi \ < — - — . 

In some hypergraph (X , H ), every closure-line consists of at most k vertices 
and there are precisely m{p— 1, k) distinct closure-lines altogether. A theorem 
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of Behzad, Chartrand, and Cooper, Jr. [1] guarantees that (the chromatic 
index of the complete graph K 2s is 2s — 1, and so) there is a mapping 

<j>:{S: Sc{l,2,...,p}, \S\ = 2}^X 

with the following property: 

for every % in {1,2, ... ,p} and for every w in Xo 

there is precisely one j in {1, 2, . . . ,p} such that <p({i,j}) = w. 

Set 



Hi = {{u, v, w} : there are % and j with u G Vi, v G Vj, <p({i,j}) = w}, 
H 2 = {S : \S\ — 3 and there is an % with S C V^}, 

and H = H U HiU H 2 . Since closure-lines in hypergraph (X, iJ) are 

• all the closure-lines in hypergraph (X ,H ), 

• all the sets Vi U Vj U j})} sucn ^at 1 < i < j < p, and 

• all the sets Vi such that 1 < i < p, 



we have 



m(n, k) < m(p — 1, k) + K J + p . 



v 2 y 

If p — 1 > k, then (as p — 1 < n/3) the induction hypothesis guarantees that 



m(p-l,«0<c('^) <|(|) 2 ; 



if p — 1 < k, then 
finally, 

We conclude that 
□ 



c /n N 2 



m( J> -l,k) = l<-[- k ) ; 



c f n\ 2 ( n\ 2 n 2 
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